Skip to content

Add cortex_m MVE/Helium int16 quantize/dequantize support (#19218)#19218

Merged
meta-codesync[bot] merged 1 commit intomainfrom
export-D103129855
Apr 30, 2026
Merged

Add cortex_m MVE/Helium int16 quantize/dequantize support (#19218)#19218
meta-codesync[bot] merged 1 commit intomainfrom
export-D103129855

Conversation

@3l1
Copy link
Copy Markdown
Contributor

@3l1 3l1 commented Apr 30, 2026

Summary:

Adds MVE-vectorized int16 (Short) paths to the Cortex-M ExecuTorch quant ops, so the ARM partitioner rewrites int16 quantized_decomposed::* to cortex_m::*. Previously cortex_m only handled int8 — int16 quant ops fell through to the portable scalar fallback.

Differential Revision: D103129855

cc @digantdesai @freddan80 @per @zingo @oscarandersson8218 @mansnils @Sebastian-Larsson @robell

@3l1 3l1 requested a review from rascani as a code owner April 30, 2026 03:18
@pytorch-bot
Copy link
Copy Markdown

pytorch-bot Bot commented Apr 30, 2026

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19218

Note: Links to docs will display an error until the docs builds have been completed.

⏳ 1 Pending, 2 Unrelated Failures

As of commit f3fe5c6 with merge base e84a418 (image):

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@meta-cla meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 30, 2026
@meta-codesync
Copy link
Copy Markdown
Contributor

meta-codesync Bot commented Apr 30, 2026

@3l1 has exported this pull request. If you are a Meta employee, you can view the originating Diff in D103129855.

@github-actions
Copy link
Copy Markdown

This PR needs a release notes: label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

@3l1 3l1 requested a review from digantdesai April 30, 2026 03:21
@3l1 3l1 added ciflow/trunk module: arm Issues related to arm backend labels Apr 30, 2026
@meta-codesync meta-codesync Bot changed the title cortex_m int16 quantize/dequantize MVE Add cortex_m MVE/Helium int16 quantize/dequantize support (#19218) Apr 30, 2026
meta-codesync Bot pushed a commit that referenced this pull request Apr 30, 2026
Summary:

Adds MVE-vectorized int16 (Short) paths to the Cortex-M ExecuTorch quant ops, plus the AoT partitioner update so the ARM partitioner rewrites int16 quantized_decomposed::* to cortex_m::*. Previously cortex_m only handled int8 — int16 quant ops fell through to the portable scalar fallback (no MVE on Cortex-M55, since the portable path is gated on __aarch64__ || __ARM_NEON__ which Cortex-M55 has neither of).

Differential Revision: D103129855
@meta-codesync meta-codesync Bot force-pushed the export-D103129855 branch from 6b9ce08 to a27f25f Compare April 30, 2026 03:49
meta-codesync Bot pushed a commit that referenced this pull request Apr 30, 2026
Summary:

Adds MVE-vectorized int16 (Short) paths to the Cortex-M ExecuTorch quant ops, plus the AoT partitioner update so the ARM partitioner rewrites int16 quantized_decomposed::* to cortex_m::*. Previously cortex_m only handled int8 — int16 quant ops fell through to the portable scalar fallback.

Differential Revision: D103129855
@meta-codesync meta-codesync Bot force-pushed the export-D103129855 branch from a27f25f to 89e6708 Compare April 30, 2026 03:50
meta-codesync Bot pushed a commit that referenced this pull request Apr 30, 2026
Summary:

Adds MVE-vectorized int16 (Short) paths to the Cortex-M ExecuTorch quant ops, so the ARM partitioner rewrites int16 quantized_decomposed::* to cortex_m::*. Previously cortex_m only handled int8 — int16 quant ops fell through to the portable scalar fallback.

Differential Revision: D103129855
@meta-codesync meta-codesync Bot force-pushed the export-D103129855 branch from 89e6708 to 5cea79a Compare April 30, 2026 03:51
Summary:

Adds MVE-vectorized int16 (Short) paths to the Cortex-M ExecuTorch quant ops, so the ARM partitioner rewrites int16 quantized_decomposed::* to cortex_m::*. Previously cortex_m only handled int8 — int16 quant ops fell through to the portable scalar fallback.

Differential Revision: D103129855
@meta-codesync meta-codesync Bot force-pushed the export-D103129855 branch from 5cea79a to f3fe5c6 Compare April 30, 2026 18:07
@3l1 3l1 added the partner: arm For backend delegation, kernels, demo, etc. from the 3rd-party partner, Arm label Apr 30, 2026
@meta-codesync meta-codesync Bot merged commit 94d2881 into main Apr 30, 2026
599 of 609 checks passed
@meta-codesync meta-codesync Bot deleted the export-D103129855 branch April 30, 2026 23:47
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ciflow/trunk CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. fb-exported meta-exported module: arm Issues related to arm backend partner: arm For backend delegation, kernels, demo, etc. from the 3rd-party partner, Arm

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants